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Abstract 



Indeed, these are exciting times. We are in the heart of a digital renaissance. Automation 
and computer technology allow engineers and scientists to fabricate processes that amalgamate 
quality of life. We anticipate much growth in medical image interpretation and understanding. 



t/3 ' due to the influx of computer technologies. This work should serve as a guide to introduce 

^ the reader to core themes in theoretical computer science, as well as imaging applications for 

understanding vocal-fold vibrations. In this work, we motivate the use of automation and 
review some mathematical models of computation. We present a proof of a classical problem 
in image analysis that cannot be automated by means of algorithms. Furthermore, discuss 
some applications for processing medical images of the vocal folds, and discuss some of the 
^j>^ ' exhilarating directions the art of automation will take vocal-fold image interpretation and quite 

\^ , possibly other areas of biomedical image analysis. 
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^ '. 1 Introduction 



Although more recently it has been recognized as its own discipline, the fruits of Biomedical Engi- 
neering (BME) have long been apparent, and will continue to revolutionize the way human beings 
understand and administer medicine. As a multi-disciplinary field, BME draws upon the exper- 
tise of Biology, Chemistry, Engineering, Mathematics, Medicine, Physics, Physiology, Probability 
and Statistics, and several others. Advancements in computer technologies and imaging modalities 
over the last several decades have attributed to the wide spread growth of medical image analy- 
sis, an application central to BME |25] . Semiautomated algorithms will pave the way to better 
understanding of medical image analysis and help shape automated biomedical imaging. 

In the era of big data and information processing, we are in the heart of a digital renaissance. 
Despite the obvious power of the computer technology available today, there exist many inherent 
limitations that dramatically affect the types of software and methodology available for medical 
image analysis. Furthermore, several researchers have posed general questions regarding the efficacy 
of automated systems. For example, trust, workload, and risk influence the adoption of automated 
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systems, while individual differences make predicting success of these systems difficult, and false 
alarms lead to underutilization of the automated systems j2?j . 

In medical image analysis, automation should serve as a tool to aid clinicians, pathologists, 
physicians, and surgeons, not serve as a replacement. Deployment of robust automated systems 
takes several years of research and development, and several validation studies to ensure the quality 
of the automated work. For example, the air traffic control, railroad systems, and several areas of 
manufacturing have been successful in harvesting the fruits of automation. However, in medical 
image analysis the fruits of automation are carried by quite a different tree, due to the very nature 
of each specialist's function along the medicinal pipeline. 

Researchers have proposed breaking down automated functions into four classes (1) information 
acquisition, (2) information analysis, (3) decision and action selection, and (4) action implemen- 
tation [28]. From our experience, categories (2) information analysis and (3) decision and action 
selection, for the time being, are difficult to achieve without human interaction in medical image 
analysis, and we expect to see them evolve and grow in the coming years. Category (2) requires 
cognitive capabilities such as working memory and inferential processes, while (3) requires decisions 
yielding different outcomes while having insight on the consequences of such actions [28] . 

Engineers seeking to automate systems must proceed carefully as automation has been shown to 
alter human behavior, since automation can make it difficult for the automator and/or supervisor 
to predict or anticipate outcomes of scenarios [271 128} 129] . Furthermore, an additional factor to 
consider when employing automated systems is the effect the automation will have on human 
decision making [32]. Automated agents are already becoming viable teammates to humans in 
the workforce. Although humans naturally behave socially to their digital counterparts, human- 
machine trust deteriorates faster than human- human trust, but can be greatly reduced when human 
users and supervisors better understand the workings of their automated teammates [20] , 

2 The Art of Automation 

In this section, we provide the reader with a brief overview of two well-studied abstract machines, 
models of computation. Finite automata (limited memory computers) and Turing machines, which 
are more representative of the computers we use in every day life. To make the illustration of 
the concepts in sections 2 and 3 more tangible, we have developed a software package containing 
simulated examples allowing the reader to follow along at home. See: ht tps : //github . com/ 
[habisof t/ for a listing of our software repository corresponding to this paper. 

2.1 Automata Theory at a Glance 

Finite Automata are mathematical models of simple computation. Despite their limited amounts of 
memory, they play a key role in several practical areas of computer science, such as, compiler design, 
command line and interpreter design, programming language development, hardware design, and 
other various text processing applications [3T\ 117] . Finite automata have long been studied for 
their ability to generate and recognize regular languages. Regular languages have several special 
properties, such as, their closure properties, that is, the resulting language is also a regular language 
when performing boolean operations like complementation, intersection, and union on the strings 
of the language in question. Furthermore, when performing regular operations like concatenation, 
Kleene star, and string reversal, the resulting string will still be members of a regular language. 

Mechanically speaking, finite automata are modeled as a reading head and a finite length input 
tape. The reading head of a finite automaton contains a finite number of states for defining its 
control logic, and can only move from left to right while scanning across its input tape, reading one 
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input symbol at a time. Depending on the sequence of input symbols read, and the finite automa- 
ton's transition function, the automaton will transition to different states. A finite automaton is 
not allowed to write to its input tape, this can be viewed as read-only memory (ROM). On any 
given input string, a finite automaton will accept the string if the string is a member of the language 
that the automaton recognizes or reject the string if the string is not a member of the language the 
automaton recognizes. Fig. 1 illustrates a mechanical diagram of a finite automaton. 



Figure 1: Mechanical diagram of a finite automaton. Reading one element at a time from a finite 
length tape, the reading head moves from left to right. 

Suppose we have a finite alphabet of input symbols S = {a, b, c, . . . , z. A, B, C, . . . , Z}, that 
is, the alphabet of lower case letters a through z and upper case letters A through Z from the 
English alphabet. Note, by definition, all strings from S are of finite length |3H I17j. Motivated by 
Cornell University's library of e-print articles at |arxiv . orgjin the Computing Research Repository 
(CoRR), we devise the following regular expression (REX) that generates the language of all strings 
that contain at least one occurrence of the substring CoRR is given by Eq. (1) 



Crex = S*(CoRR)+S*. 



(1) 



In Eq. (1), the S*, is an application of the Kleene star operation over the alphabet S, and denotes 
to oo many strings from the alphabet S. The term (CoRR)"*" represents 1 to oo repetitions of 
the string CoRR, and is an example using the Kleene plus operation over the string CoRR. Now, 
in plain English, the regular expression in Eq. (1) can be read as any string (including the empty 
string), followed by the string CoRR (at least once), followed by any string (including the empty 
string). Let this language be called substring CoRR. 

Fig. 2 is a deterministic finite automaton (DFA) D that recognizes the language substring 
CoRR, which we denote as L{D) = Crex, that is, the language of D is Crex- 



E\{C} 




E\{R} 



Figure 2: A Deterministic finite automaton D, such that L{D) = CreX; that is, D recognizes the 
language substring CoRR, with accepting paths in bold. 

We illustrate DFAs as directed graphs. When DFA D is in state qo and it reads a C, the DFA 
will transition into state qi, when in state qi, if an o is read, the DFA will transition into state 
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^2, SO on and so forth. Upon reading C and o, and a total of two Rs are read, the DFA wih be 
win in state (74, the accepting state. The accepting state is denoted as a circle within a circle. The 
attribute that makes an automaton deterministic is the fact that a single transition (edge) is given 
for each input symbol of the alphabet S for each state of the automaton. In Fig. 2, for simplicity, 
we do not draw a transition for each lower case and capital letter of the English alphabet, rather 
we write S \ {a particular symbol}. For example, state qq has a reflexive edge that points right 
back into state qo with the transition labeled S \ {C}, which is read as the entire alphabet except 
for C. Likewise, the edge from qi to qo is read as any symbol in the alphabet E, except for C and o. 

2.2 Computability Theory at a Glance 

Similar to a finite automaton, a Turing Machine (TM), first introduced by Alan Turing in the late 
1930's as the a-machine, is a model of computation [34]. Like the automaton in Fig. 1, a TM has 
a reading head and an input tape. Unlike finite automata, the TM's reading head is a read/write 
head, and the input tape is of infinite length. Furthermore, the TM's reading head can move left 
or right as determined by the device's transition function. Unlike finite automata, a TM has a 
separate start and accept states; they take place immediately. These subtle differences between 
finite automata and TMs creates a world of difference in terms of the types of languages these 
devices can recognize. 

On any input string, a Turing machine will accept, reject, or loop. Accepting and rejecting 
are halting behaviors, the TM will start its simulation and stop after a finite number of steps. A 
language is Turing- decidahle or recursive if some Turing machine halts and accepts every string 
in the set or halts and rejects for non members. A language is Turing-recognizable or recursively- 
enumerable if some TM enumerates the strings in a language, but does not necessarily halt. We note, 
the computers we employ in everyday life do not have infinite amounts of memory, however, the 
amount of memory available is so large, that modeling a computer as finite automata is inaccurate 
and counterproductive ^2]. In fact, our intuitive notion of algorithms for the computers we use 
everyday are equivalent to Turing machine programs [31j. 

We use set notation to denote sets of strings that obey some property, we call these sets lan- 
guages. The following is the set theoretic notation for the language CreX; the language of strings 
containing at least one occurrence of CoRR. 

Crex = {w\w is a string that contains the substring CoRR }. 

Below, the TM [/ is a decider for the language Crex- U is an example of the universal Turing ma- 
chine and is capable of simulating finite automatons and other Turing machines. In this particular 
example, U will simulate the DFA D on the input string w, in this case, D can be thought of as a 
subroutine or function in regards to modern programming. 

[/ = " on input < D,w >, where D is a string encoding the 
description DFA D and w is a string: 
(1.) Simulate DFA D on the input string w. 
(2.) If D's simulation ends in an accepting state, accept, 
if the simulation ends in a non accepting state, reject. " 
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2.3 On Computability and Countability 

The very framework of computability and to a large degree most of discrete mathematics are built 
upon set theory and countability of sets. One interesting result of computability is that the set 
of all strings S* can be put into correspondence with the set of natural numbers N, therefore, 3 
a bijection z : S* — )• N, a function that maps every element of S* to every element of N. For 
example, suppose our finite alphabet is the binary alphabet S = {0, 1}. Our bijection z : S* — )■ N 
goes as follows, in lexicographical order list all the strings in S*, that is, in increasing order, list 
all strings of length 0, then all strings of length 1, then all strings of length 2, so on and so forth. 
While listing all strings in lexicographical order list all the numbers in N. As the elements of both 
sets are being enumerated, pair the current element of S* with the current element in N. Below, a 
visual representation of the bijection z : S* ^ N is given, where e is the empty string. 



N 



{0, 



0, 
1, 



00, 
3, 



01, 

4, 



10, 

5, 



11, 
6, 



000, 
7, 



•••} 
...} 



Via the sets enumerated above, each column in S* is aligned to a corresponding column in N, this is 
our bijection, clearly the set of all strings and the set of natural numbers are both countably infinite, 
therefore, the sets are the same size or cardinality, denoted = |N|. In fact, bijective functions 
are computable maps, functions that are decidable by some TM in a polynomial number of steps 
with respect to the length of the given input. Georg Cantor, arguably the father of elementary set 
theory, spent a major portion of his career working on the cardinality of various infinite sets and 
published many works on the topic in the late 1800's. One interesting problem Cantor studied was 
the cardinality of the rational fractions Q = |m,n S N}. Oddly enough, |Q| = |N|. 

Lemma 2.1. 3 a bijection y : Q — >■ N .'. |Q| = |N|, today the bijection is known as Cantor's 
diagonalization argument \3T\ I12j . we will not prove it here, however, in the proof of the next 
lemma the same technique is applied. 

Lemma 2.2. Proof: 3 a bijection / : N x N N 

In order to demonstrate the bijection from the Cartesian product of the natural numbers with 
the natural numbers we must pair each element from N x N with a unique element of N. We begin 
by listing all possible pairs of N x N in an infinite table, which is given below. 



■(0,0)o 
(l,0)i 
(2,0)3 
(3,0)6 
(4,0)io 



(0,1)2 

(1,1)4 

(2,1)7 

(3,l)ii 

(4,1) 



(0,2)5 

(1,2)8 

(2,2)i2 

(3,2) 

(4,2) 



(0,3)9 

(l,3)l3 

(2,3) 
(3,3) 
(4,3) 



(0,4)i4 

(1,4) 

(2,4) 

(3,4) 

(4,4) 



Notice in the table above, we use subscript notation to demonstrate the element of N that each 
ordered pair from N x N maps to. The bijection works by starting at element (0, 0) and then 
simply counting up along all the subsequent diagonals after the first diagonal starting at (0,0). 
More specifically, the bijection /, has the following closed formula: 



f{a,b) 



(a + 6)(a + 6 + 1) 



+ b. 



The counting scheme presented in this lemma is a variation of the diagonalization argument, also 
known as Cantor's pairing function, thus, we have successfully illustrated the existence of a bijection 
/ : N X N ^ N .-. |N X NI = |N| □ 
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Bijective functions have the special property of being invertible functions. The concept of 
invertibihty has deep imphcations, thus one problem instance (language) may be casted to another 
more malleable set in a finite number of steps. Another result of set theory is the notion of a set's 
complement, which is a set of everything that is not contained in the set in question. For example, 
the complement of set A is denoted as A, and would contain every element not found in A. Another 
example would be the complement of N, the real numbers, therefore, N = M, these are decimal 
numbers that are not necessarily rational and in fact, M is an infinite set of infinite sets. Another 
interesting result where computability meets countability is that the set of all TMs is countably 
infinite, thus |TM| = [N|, all TMs can be put in correspondence with the set of natural numbers 
|3H 112] . Yet, this also has deep implications in that there are more languages (problems) than 
there are Turing machines, this in turn implies that there are some languages that are not even 
Turing-recognizable, which in turn brings us to the following lemma. 

Lemma 2.3. Proof: A language L is decidable if and only if it is Turing-recognizable and its 
complement L is also Turing-recognizable (co- Turing-recognizable) |31j . 

=^ if L is decidable, both L and L are Turing-recognizable. 

Any decidable language L is by default Turing-recognizable, and given the closure property of 
the set of decidable languages under complementation, the complement L is also decidable. There- 
fore, if L is decidable, then the decidability and recognizability of L is trivially true. 

<^ if both L and L are Turing-recognizable. 

For this direction, we let the TM Mi be the recognizer for L and the TM M2 be the recognizer 
for L. Now, we construct the TM M that decides L. 

M = "on input w, where w is a string: 

(1.) Run both Mi and M2 on input w parallel. 
(2.) If Ml accepts, accept; If M2 accepts, reject. " 

Running the TMs Mi and M2 in parallel can be achieved by simulating each machine's input tape 
on a two-tape TM that toggles between simulating a step on Mi and then simulating a step on 
M2 until either machines accept. All strings w will be either in L or L. Therefore, Mi or M2 will 
accept w at some point during their simulation. M accepts all strings in L and rejects all strings 
in L. Therefore, M will always halt, making it a decider for L. □ 

Despite the obvious power of our computers, there are limitations on the types of problems 
algorithms can solve, and they should not be taken lightly. We live in a fast-paced society where 
time and money dominate the decisions we make in our waking lives, therefore, having a firm un- 
derstanding of computationally infeasible problems can save time, money, and the stresses induced 
by exhausting either without some sort of gain. 

3 A Simple Energy Optimization Problem in Image Analysis 

In computer vision and image processing related tasks, segmentation is a processing step that 
aims to separate or extract features of interest from image data. In many medical image analysis 
problems, segmentation is a preliminary step that must be performed before measurements and 
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additional relationships can be determined. Some segmentation methods employ thresholding of 
intensity information, some take advantage of edge information, while some methods even use com- 
putational geometry to incorporate prior shape information. One popular tool in image processing 
is the active contour or snake model, which is capable of incorporating edge, intensity, and prior 
shape information to segment and extract features of interest from digital images by deforming a 
contour towards salient image features |15j . 

Since their introduction, several other various deformable models have been devised including 
level set and curve evolution algorithms that exploit intensity information by evolving a contour 
using the inherent topology information specified by the change of intensity within an image [26117]. 
Some snake models even treat the contour of interest as foreground information contained within 
the domain of a closed-loop curve being segmented from the rest of the image, which is treated as 
background information, by considering the behavior of the intensity information contained within 
the closed curve [8]. While other methods choose to deform a finite element model that specifies 
the contour of interest [9]. We focus on classical snake theory as its roots extend into modern 
renditions of these concepts. 



3.1 Snake-energy Optimization 

Our concern is discrete sets of pixels and the steps necessary to partition the data into meaningful 
regions. Let / be a 2D gray scale image with two dimensions corresponding to the width and height 
of the image, and one dimension corresponding to the intensity, such that / C N x N x N, where the 
values of each dimension are bounded by the natural numbers. A snake s is a spline bounded by 
the image plane, therefore s G /. Over the years, the series of vertices that comprise the snake have 
been termed snaxels, a short hand for snake elements. We denote the length of our snake as |s| = n, 
where s has n snaxels. In classical snake theory, a snake will satisfy the following force-balance 
condition fint + fext = 0, where fint is an internal force, the result of the shape of the snake, and 
fext is an external potential-force field derived from image data. Until the force-balance condition 
is met, our snake s is repeatedly deformed to minimize an energy functional of the following form 

n 

^:nake(s) = t^-^^^^) + ^ext(Si)] , (2) 
1=1 

where the internal force provided by our snake's shape is, 

1 



a • |sj - Sj_ip + /? • |sj+i - Sj + Si_i|^ , (3) 
and the external potential provided by our image-data term is 

^^ext(si) = -7- |V/(s,)|^ (4) 

Our snake s is a controlled-continuity spline, that behaves like a rigid rubber band structure. In 
Eq. (3), a and /3 are parameters that influence the continuity and rigidity, respectively. And Eq 
(4) is simply the magnitude of the image gradient where 7 is a weighting parameter for adjusting 
the strength or importance of the data term. Furthermore, several authors have improved the 
variational snake-deformation algorithm by devising external force fields that improve snake con- 
vergence even when the snake is far from the features of interest, by modeling the external potential 
as specific kinds of physical processes in time [IHl |6l HI] . 

Variational approaches make use of higher order derivatives in order to compute internal and 
external forces acting upon a snake [15]. Yet, our data is discrete; in variational approaches. 
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all is fine during snake-deformation until we wish to employ hard- constraints on our contour of 
interest. Hard-constraints in optimization problems are those which force a solution, in our case, 
our snake, to obey some desired behavior. Under hard-constraints, variational approaches become 
numerically unstable. For example, a hard-constraint could be the specification of a desired distance 
between the snaxels that comprise a snake [3]. Furthermore, upon each successful iteration of snake- 
deformation, the external force acting upon a snake will exhibit minor perturbations as decimals 
values are influenced by noise and fluctuations in the computer memory [3l |36] . 

We note an example of a variational snake-deformation algorithm that should be able to over- 
come the pitfalls of classical snake-deformation approach, namely the numerical instability under 
hard constraints [18]. The method treats the series of snaxel's that comprise a snake as a set of 
masses connected by spline segments, that are accelerated towards image features. For each snaxel, 
an appropriate mass and dampening force can be applied, however, in our experience, selection of 
parameters that lead to ideal convergence can be difficult and the snake deformation can be time 
consuming. We wish to employ the discrete dynamic programming algorithm outlined in [3] as the 
kernel of the algorithms in our discussion on snake-energy optimization. 

Dynamic programming shares an interesting relationship with variational problems. It works 
by decomposing the system into a series of overlapping subproblems and enables us to optimize 
the cost of the variational system over a single dimension by treating a variational problem as a 
discrete multi-stage decision making process. When applying dynamic programming, convergence 
is guaranteed, since the number of possible states is a monotonically decreasing function with 
respect to an increase in iterations. Dynamic programming is capable of bypassing local minima. 
With regards to snake-energy minimization, such algorithms have these additional advantages: 
numerical stability due to not employing higher order derivatives in assessing the cost function, 
and guarantees the globally optimal snake-deformation and energy minimization for the given set 
of parameters. For more details, refer to [3l U]. Furthermore, in our experience, the dynamic 
programming approach is less sensitive to choice snake parameters in comparison to its variational 
counterparts. 

3.2 Application of Snakes in Laryngeal Imaging 

Here we provide the reader with some examples of snakes being used for medical image feature 
extraction, namely in regards to laryngeal high-speed videoendoscopy, high-speed videos of vocal- 
fold vibrations [10]. Kymography, with respect to vocal-fold {laryngeal) imaging, is the use of 
high-speed line-scan images across the left and right vocal fold [33j. Typically, segmentation of the 
glottis, the anatomical structure formed between the vocal folds as air separates them, is performed 
to determine additional characteristics such as measuring the left and right vocal folds. Several 
methods exist in image processing for segmenting features of interest from images, however, methods 
relying purely on image intensity information can result in imperfections and discontinuities in the 
extracted features. In biomedical imaging, structures of interest are subtle and can vary significantly 
from subject to subject, and it is for this reason many researchers choose to employ snakes or similar 
smooth contour delineation techniques, since this type of processing imposes a degree of smoothness 
on the extracted features. 

Laryngeal imaging differs significantly from other types of medical images since the phenomenon 
of interest is non-stationary and is rapidly changing. Vocal fold vibrations are fast, complex and 
fine. Several researchers \22 \ \T\ [T9 \ I2 H [T^. plus many others, have used snakes for laryngeal image 
analysis and its applications. Our goal here is not to provide a quantitative evaluation of such 
techniques, rather it is to introduce the reader to application of such image processing tools for 
medical image interpretation with respect to such laryngeal images. Fig. 3 shows a pair of temporal 
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snakes being deformed within the space-time domain of kymographic image sequences derived from 
laryngeal high speed video data [21]. 




Figure 3: Starting from the middle of the glottis along all glottal regions within the kymographic 
image, a pair of temporal snakes being deformed on a kymographic image. 

Once the temporal-snake transform has been applied to all of the kymographic images containing 
the vocal folds, the resulting contours can be trivially remapped to the spatial domain of the 
recording as delineated left and right vocal folds shown in Fig. 4. Note, in healthy vocal fold 
vibration, the vocal folds are restricted to moving left and right. 




frame = 146 frame = 156 frame = 166 







1 





frame = 176 frame = 186 



Figure 4: A subject's left and right vocal folds delineated using the temporal-snake transform, 
with white contours imposed on the vocal fold boundaries. This particular sequence shows the 
transition from a closed to an opened glottis. The horizontal white line across the vocal folds has 
been superimposed to demonstrate the approximate middle kymographic scan line between the top 
and bottom of the vocal folds. The brightness in these images has been increased for printing. 
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3.3 A Sncike in the Turing Machine's Shadow (Contribution 1) 



In the previous subsection, we introduced snakes as a computerized spline deformed in the spa- 
tial domain of a digital image in order to minimize energy and result in feature extraction. Our 
arguments in this section are closely related to the snake model. It is our belief that such ar- 
guments may hold for many other computational models in image analysis and merits continued 
investigation. Before a snake can be deformed to minimize its energy, it must have a proper ini- 
tialization. Although in practice snake initialization can be tricky and often times not well defined, 
we ignore the need of initialization in the proof of the following theorem in order to simplify the 
proof construction, since it can easily be shown that there are an exponential number of legal 
configurations the snaxels of a snake can be in before applying the snake-deformation algorithm. 
This does not detract from the general argument, rather it makes it more accessible to the reader. 
In practice, snake-initialization is typically achieved via information granted from a higher level 
process, operator specification, or some other computable function as defined by the application. 

Recall, traditionally, a snake will have at least three parameters, a a weight on the spline's 
continuity, /3 a weight on the spline's rigidity or difficulty in bending, and 7 a weight governing the 
strength of or how reliable the data term is. The fact of the matter is, given parameters, a snake's 
energy can be minimized, however, the problem of snake-parameter selection is a computationally 
intractable problem. This is not to say that an approximation, or some heuristic or empirical 
human made observations on the optimal parameters or optimal parameter range can be devised 
for a particular data set or even data type, rather, there is no way to define the optimal parameters 
via an algorithm, even if an ideal initialization is always available. 

Theorem 3.1. Proof: The language of snake parameters is undecidable. 
Recall, the language of snake parameters is given by the following set 



Now, we must construct a bijection g':NxNxN— >-N. We could construct another multidimensional 
table and enumerate all entries of N x N x N along with all the entries of N to pair the elements, 
however, we can take a more direct approach by taking advantage of function composition to derive 
a closed form formula. Using the bijection / established in lemma 2.2, the formula for the bijection 
g{a, b,c) = d is as follows: 



P = {a,/3,7 I a,/?,7GQ}, 
that is, the language of triplets of rational fractions. 



y p E P, 3 x,y, z E Q, such that p = {x, y, x} 



^ p G {Q X Q X Q} 



^ P = {Q X Q X Q} 
via Lemma 2.1, a bijection can be used to show |Q| = |N|, 



^ P = {N X N X N} 



g{a, b, c 



■) = f{aj{b,c)) 
_ {a + f{b,c)){a + f{b,c) + l) 



2 



+ f{b,c), 
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thus, a bijection gi:NxNxN— >N has been estabUshed, which in turn illustrates |P| = |N|. 
Now, via lemma 2.3, a language and its complement must both be Turing-recognizable in order for 
the language to be decidable. To summarize, in the case of the language of snake parameters, the 
language itself can be reduced to the natural numbers, however, the complement of this set, the 
real numbers, is not even Turing recognizable! 

the language of snake parameters P is undecidable. □ 

Again, it is quite possible to determine some heuristic or employ empirical observations on the 
parameters or set of parameters that lead a snake to ideal convergence on a sequence of images, 
however, it is impossible to write a computable function to calculate optimal snake parameters, no 
such function exists. 

Corollary 3.2. by Theorem 3.1, automated snake algorithms that require computing updated 
snake parameters are intractable by means of deterministic computation. 

In practice, when dealing with image sequences, there arise occasions in which the same parameters 
cannot be used for all snakes we wish to deform within the series. Especially when images exhibit 
sharp changes in brightness or contrast, or cases in which a snake is in regions of strong noise 
and a lack of noise simultaneously. Wc note that the dynamic programming algorithm for snake 
deformation does allow adjusting a, (3, and 7 on a per-snaxel basis, however, the problem of selecting 
parameters on a per-snaxel basis is still undecidable. 

4 Discussion 

In the previous section, we proved theorem 3.1, the language of snake parameters is not Turing- 
decidable, given a particular snake instance, suitable parameters cannot be determined algorithmi- 
cally. However, this result should not come as a big surprise since such models in image analysis 
employ variational frameworks for regularizing a solution, to make ill-posed problems more manage- 
able. In constraint optimization problems, hard- constraints can significantly reduce the cardinality 
of the set of possible solutions, since all solutions will have to conform to a particular structure. 
Whereas, soft- constraints are constraints only on the cost function employed for the optimization 
process. In regards to snake models, the parameters a and /? for specifying a contour's internal 
energy are regularization terms, in fact, they are soft-constraints, they do not convey any additional 
information that all solutions must obey, rather information regarding the solution for a specific 
instance of the problem. The parameters have been introduced to make the energy optimization 
solvable in the first place. 

Thus, the difficulty in solving the ill-posed energy minimization problem has been casted from 
the problem of even being able to find a solution to a particular problem (by introducing regular- 
ization parameters), to the problem of finding suitable parameters to be used in finding a solution 
to a particular problem instance, which in turn implies, herein lies the problem! In general, it is 
the non-uniqueness of the minimal energy contour produced by algorithms such as the snake model 
that make these problems difficult to automate, especially in the context of sharp variations or 
changes when traversing image sequences. 

4.1 A Human Being is a Deciding Being 

Although fully automated biomedical interpretation is desired, it is a long term goal. Furthermore, 
semiautomated methods may in fact prove to be more useful until a general mathematical theory 
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explaining human perception and understanding of imagery has been developed, or the problem 
of simulating human intelligence via computer has been solved. A skilled professional such as a 
clinician or surgeon could take advantage of semiautomated methods that can potentially auto- 
mate a vast majority of the monotonous and redundant tasks in image interpretation, rather than 
the automation acting as a second opinion or a replacement. With semiautomated image analysis 
techniques, the user of a software tool is able to adjust the process until they obtain a desirable 
result. One of the co-creators of the original snake model has gone through great effort to sum- 
marize the different types of snake and deformable models for medical image analysis, concluding 
semiautomatic and interactive methods will remain dominant in practices for years to come |23j . 

We wish to take the argument a step further. As semiautomated methods for biomedical image 
analysis dominate the field, these methods will be the key to designing fully automated methods, 
and may even serve as a gateway to developing simulations of human-like intelligence via computer. 
In the case of the temporal-snake transform [24] . after a few attempts a user would converge to the 
set of parameters that yield the best result for a particular image sequence. This should not come 
as a surprise as humans are undoubtedly great deciders of computationally hard problems. 

Let us digress for a moment and take note that supervised learning and semi-supervised learning 
algorithms, are methods which take advantage of data labeled by a human that is representative 
of the phenomenon of interest, by employing variational techniques for performing classification or 
optimizing parameters that yield desirable results [13]. Such training algorithms not only require 
human interaction to produce samples of what is desired by the analysis, but tuning the parameters 
again can be quite time consuming, and also require large amounts of time for off-line training. 
Furthermore, such methods may require retraining upon the introduction of new features and or 
data that deviates greatly from the anticipated input, and suffer when the data contains large 
amounts of redundant information. 

In recent years, unsupervised and deep learning algorithms have gained substantial amounts 
attention, due to the desire of determining relationships in big data and to the widespread use of 
the internet and search engines. Deep learning methods are often casted as off-line unsupervised 
machine learning problems which seeks to determine the global optimum in pattern classification 
without human interaction, by taking advantage of the statistical power of extremely large data 
sets to determine a relationship among samples [5]. Deep learning methods are promising, yet 
again, the off-line training can be quite time consuming and require many computational resources, 
and even the most state-of-the art deep learning techniques achieve recognition rates of 74% - 82% 
|16j . In the context of medical image interpretation, such resource requirements and performance 
cannot be tolerated. Due to the requirement of enormous amounts of data to take advantage of 
the statistical power of such methods, and due to the lack of accessibility of several large databases 
of medical images of the same type of anatomical structure, application of these approaches is 
infeasible. Without large networks of data readily available between medical image collaborators 
and machine learning experts, employing these learning techniques is difficult for the time being. 

Returning back to the underlying theme of this section of the text, aside from the medical image 
interpretation problem, another great example of computationally hard problems are video games, 
which are in fact deterministic computer simulations. Despite their novelty and being ridiculed 
as entertainment for children, understanding such interactive systems may play a major role in 
many aspects of developing automated human intelligence. After some finite number of attempts, 
a human player of a video game will converge to the solution of a particular puzzle, dungeon, level, 
or even complete the game. Yet, in recent years, researchers have demonstrated the algorithmic 
complexity of automatically playing such games. Automating a player that obeys the rules of 
simulated worlds of several video games and models of game play that have been around since the 
late 1980's is computationally intractable p!T| \2\ [35]. 
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Within the blink of an eye, a player must adjust their strategies and continue to play all while 
acquiring and processing several stimuli simultaneously. Similarly, while using semiautomated 
medical imaging tools, the operator must adapt, and proceed in a fashion that yields desirable 
results. By taking advantage of the human computer, the brain, and limiting the amount of 
interaction necessary, it is our belief that a human being can accurately and quickly extract features 
of interest from medical images. And upon proper feature extraction, any additional analysis can 
be fully automated. 

4.2 Automata are All Around Us and Also Within Us (Contribution 2) 

Indeed, automata are all around us and some within us. For example, anyone who has visited a 
supermarket is immediately greeted by an automaton upon entering. The little proximity sensor 
that resides above the entrance that detects when a person is standing near the doorway is a finite 
automaton that automatically controls the opening and closing of the sliding doors [31] . Although 
finite automata are simple reflex agents, when a system comprised of several automata interacting 
with one another is in motion, the system exhibits complex and even chaotic behavior. Cellular 
automata first introduced by Ulman, von Neumann, and Zuse in the 1950's are discrete grids with 
a finite automaton existing at each grid point, where each automaton's state is visible to itself and 
its neighbors [38j. 

In some cases, cellular automata can be viewed as discretization of partial differential equations 
in time. For example, lattice-gas cellular automata and lattice Boltzmann are cellular automata in 
applications of fluid and gas flows that have been used to solve and derive macroscopic properties 
of the Navier-Stokes equations and the discrete Boltzmann equation, respectively [37]. In addition 
to the behavior of fluids and gases, cellular automata appear in a wide array of places in nature. 
Cellular automata can be used to describe plant growth, plant branching patterns, growth of 
antlers in animals, shell geometry, shell visual pattern generation, various genetic programs (e.g. 
DNA replication), simple animal behaviors, animal pigmentation patterns (e.g. butterflies, flsh, 
snakes, tigers, and zebras), and even tissue growth (e.g. bones and tumors) [39] . 

Wolfram, one of the flrst authors to widely adopt and build upon cellular automata argues 
much like how the telescope helped shape astronomy, and how the microscope helped shape biology, 
computer technology will help shape engineering and mathematics [39]. We agree with this view, 
and would like to take it a step further, more speciflcally, automation and computation will indeed 
help shape biomedical image analysis, especially when the tool is used properly. As engineers and 
scientists, we must know the limitations of computational tools we brandish and develop intuition 
for recognizing when our tools can be naturally extended to solve additional problems. 

Following in the footsteps of Wolfram, our long term goal is to determine cellular automata 
rules that accurately generate vocal fold vibrations. We take the flrst steps towards this goal 
by deflning a non-context-free language that synthesizes magnitude of image gradients for normal 
vocal fold vibrations as would be derived from kymographic image data. Let 3 a flnite alphabet 
S = {0,n, A}, where 

O = {vocal folds opening / opened} 

□ = {vocal folds closing} 

A = {vocal folds in contact / closed} 

Now, loosely speaking, a non-context-free language that can be decided by a Turing machine in 
quadratic time that describes normal vocal-fold vibrations with respect to the information contained 
within temporally rich kymographic image sequences would be 
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^NCFL = (O^n^AO* n (D*^ A* O^r n (A^ O^' (5) 

where i,j,k > 1, cii « C2j ~ c^k, and ci,C2,C3 > 0. As seen in Figs. 5-9, using a subset of strings 
generated from the last term of Eq. (5), and a pseudo-random number generator, a kymographic 
images of healthy vibration for short vocal folds as taken at the middle kymographic scan line are 
synthesized. 
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Figure 5: Computer synthesized kymographic image gradient magnitude for short vocal folds ex- 
tracted at the middle scan line. This synthesized kymogram resembles a habitual phonation. 
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Figure 6: Computer synthesized kymographic image gradient magnitude for short vocal folds ex- 
tracted at the middle scan line. This synthesized kymogram resembles a high phonation. 




Figure 7: Computer synthesized kymographic image gradient magnitude for short vocal folds ex- 
tracted at the middle scan line. This synthesized kymogram resembles a breathy phonation. 
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Figure 8: Computer synthesized kymographic image gradient magnitude for short vocal folds ex- 
tracted at the middle scan line. This synthesized kymogram resembles a falsetto phonation. 
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Figure 9: Computer synthesized kymographic image gradient magnitude for short vocal folds ex- 
tracted at the middle scan line. This synthesized kymogram resembles a habitual phonation. 

Conjecture 4.1. 3 a 2D cellular automata that evolves in time capable of simulation normal vocal 
fold vibrations. 

Some classes of cellular automata are Turing- complete, which makes them capable of simulating 
Turing machine programs. Furthermore, some cellular automata are reversible. This has deep 
implications, given cellular automata rules that are representative of biomedical phenomena. It 
may be possible to construct analysis tools that incorporate these rules and automata to make 
more difficult problems not only manageable, but provide a unique solution for quantifying the 
phenomenon of interest. 

5 Conclusion 

In this work, we take the first steps towards developing automata rules for generating languages 
that explain physiological characteristics of vocal fold vibrations with respect to laryngeal imaging 
and its applications. It is our strong belief that not only do such approaches require continued 
investigation, but can be extended to other areas of physiological signal processing. Furthermore, 
we should be able to take advantage of individual finite automata that model a deterministic subset 
of the actions space of anatomical structures to later define cellular automata that evolve in time 
to study natural medical phenomena. 

We live in a universe were objects change according to patterns and rules, we call these rules the 
laws of nature [30] . Identifying such rules in the context of biomedical applications will allow us to 
establish computable languages capable of describing healthy behaviors of anatomical structures, 
and possibly better automated classification of pathological conditions. Before we can make the 
leap, we must harness satisfactory semiautomated analysis tools that have been decomposed in such 
a way that reduces human errors, and where human interaction is minimal and has been isolated 
to a very specific part of the framework. These achievements may yield better understanding of 
human perception, and to algorithms to simulate human intelligence. 
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